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Abstract. We study a general preferential attachment and Polya's urn model. At each step 
a new vertex is introduced, which can be connected to at most one existing vertex. If it 
is disconnected, it becomes a pioneer vertex. Given that it is not disconnected, it joins an 
existing pioneer vertex with probability proportional to a function of the degree of that vertex. 
This function is allowed to be vertex-dependent, and is called the reinforcement function. We 
prove that there can be at most three phases in this model, depending on the behavior of 
the reinforcement function. Consider the set whose elements are the vertices with cardinality 
tending a.s. to infinity. We prove that this set either is empty, or it has exactly one element, 
or it contains all the pioneer vertices. Ivloreover, we describe the phase transition in the case 
where the reinforcement function is the same for all vertices. Our results are general, and in 
particular we are not assuming monotonicity of the reinforcement function. 

Finally, consider the regime where exactly one vertex has a degree diverging to infinity. We 
give a lower bound for the probability that a given vertex ends up being the leading one, i.e. 
its degree diverges to infinity. Our proofs rely on a generalization of the Rubin construction 
given for edge-reinforced random walks, and on a Brownian motion embedding. 
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1 Introduction 

1.1 Setting and motivation 

We study the following model. Given finitely many classes (or groups) each containing a 
given initial number of members, new members arrive one at a time. For each new member 
arriving at time n, with probability s^ > we create a new class in which we place the 
member; with probability 1 — s„, we place the member in an existing class. We assume that 
each existing class attracts new members with probability proportional to a certain positive 
function of the cardinality of the group, called the reinforcement or weight scheme f. If the 
groups are allowed to have different reinforcement schemes, then we show that looking at the 
asymptotics as time tends to infinity we have exactly three different regimes: one group is 
infinite and all the others are finite; all groups are infinite; all groups are finite. Our main 
result, Theorem [131 shows that in the first regime the process will eventually create a unique 
infinite group: this happens when each group is reinforced quite a bit, but not too much 
with respect to the other groups. In the second regime, the cardinality of each group goes 
to infinity. Finally, in the last regime, all the groups will be finite; what happens is that the 
process creates various peaks: in the beginning one group dominates the others, but sooner 
or later another group will start dominating, and this change happens infinitely many times. 
In this way, no group dominates definitively the other groups. This is a kind of 'there is 
always a faster gun' principle. 

Our model is a generalization of two models from two different classes: one model from 
the class of preferential attachment models, as introduced in [13j and in [17], and one model 
from the class of reinforcement processes, as introduced in [5]. 

The first main model we are generalizing was introduced and studied independently in [T3] 
and in [T7] , and later studied in more detail in [21] and [26] . This model is part of the class of 
preferential attachment models, which are models of growing networks, and which were first 
proposed in the highly-infiuential papers [1] and [2]. In [1] new vertices arrive at the network 
one at a time and send a fixed number m of edges to already existing vertices; the probability 
that a new vertex is linked to a given existing vertex is proportional to the in-degree of the 
respective existing vertex. Here, the in-degree of a vertex is the number of children of that 
vertex. 

The model studied in [13], [17], [21] and [26] is as follows: consider a model of an evolving 
network in which new vertices arrive one at a time, each connecting by an edge to a previously 
existing vertex with a probability proportional to a function / of the existing vertex's in- 
degree. This function / is called attachment rule, or weight function, and it determines the 
existence of two main different regimes. The first regime corresponds to /(j) = j + 1 and 
it was studied in [1], [2] and [26]; the second regime corresponds to for 7 < 1 and it was 
studied in [26]. The third regime corresponds to /(j) = {j + 1)^ for 7 > 1, and it was 
studied in [21]. In the first two regimes, it is shown that the degrees of all vertices grow to 
infinity; in the third regime there is a second phase as one vertex eventually dominates all 



other vertices. In the first regime, which is a generahzation of Polya urn, the urn process is 
exchangeable. The exchangeability disappears when non-linear reinforcement is introduced 
(see [13). (For more results on preferential attachment models, see the survey [3].) 

Preferential attachment models have been motivated by real-life problems, especially in 
regards to network and internet applications. One important example of growing networks 
is the World Wide Web, in which the more popular a page (or vertex) is the more hits it 
receives; a similar principle applies to social interaction or to citation networks. Another 
example is the one of users of a software program who can report bugs on a website. Bugs 
with the highest number of requests get priority to be fixed. If the user cannot find an 
existing report of the bug, they can create a new report. However, it could be that there are 
duplicate reports, in which case the number of requests is split between the reports, making 
it less likely that the bug the user found will get fixed. Since bugs that have more requests 
appear higher up the search results, the user is more likely to add a request to an existing 
report than to a new one. 

This can be explained by the fact that such networks are built dynamically and that new 
vertices prefer to attach themselves to existing popular vertices with high in-degree rather 
than to existing unpopular vertices with low in-degree. 

The second main model we are generalizing is studied in [5], [20] and |28j . It is known as 
the generalized Polya's urn process, it belongs to the class of reinforcement processes, and can 
be described as follows. Given finitely many bins each containing one ball, new balls arrive 
one at a time. For each new ball, with probability p > we create a new bin in which we place 
the ball; with probability 1—p, we place the ball in an existing bin. The probability that the 
ball is placed in an already existing bin is proportional to /(j) = j'^ , where j is the number 
of balls in that bin. The case with p = and 7 = 1 is the well-known Polya's urn problem. 
For p = and 7 > no new bins are created, and the process is called a finite Polya process 
with exponent 7. If p > then the process is called an infinite Polya process. Similarly to 
the preferential attachment models, for generalized Polya urn processes with f{j) = p , it is 
known that for 7 < 1 the number of balls in all bins eventually grows to infinity, whereas for 
7 > 1 one bin eventually comes to dominate all other bins. (A detailed review of a number of 
other interesting results on Polya's urn processes and on reinforcement processes in general 
is provided in the survey [22j.) 

The generalized Polya's urn process has applications to many areas. We briefly mention 
one such application to biology; for an extensive overview of other applications of generalized 
Polya urn processes to reinforced random walks, statistics, computer science, clinical trials, 
biology, pshychology and economics, see for example Chapter 4 in [28]. 

The generalized Polya's urn process with p = is used in [9J and [27] to study a real-life 
application; the reinforcement scheme used in these papers is set to f{j) = p , with 7 > 0, 
and real-life data are compared against different values of 7 and initial configurations. More 
precisely, the authors study a colony of ants, which explores a chemically unmarked territory 
randomly, starting from the nest. The exploration is done on a fixed number k of paths of 



various lengths. Each ant passes along one of the k paths leaves a pheromone mark and in 
this way infuences the following ant's decision in choosing a particular path. This decision is 
also influenced by whether the paths of various lengths are discovered at the same time, or 
whether they are discovered at different times. In the real-life experiment it is noticed in the 
case of paths of equal lengths that, after initial fluctuations, one of the paths becomes more 
or less completely preferred to the others. 

We will show in our paper that the above two models, belonging to these two different 
areas, are in fact closely related because they are both special cases of our much more general 
model. The first of our results, Theorem II. H proved for our general model, unifies the 
two above-described phase transition results for a very general class of weight functions /; 
the result holds in particular both for preferential attachment processes and for generelized 
Polya's urn processes. It is worth noting that our condition on the weight function is much 
weaker than all previously-proved results for the models we generalize. Moreover, in our 
main result. Theorem 11.41 we show, under no assumptions on the weight function, that we 
can have only three possible phases; in the third phase, all groups (respectively vertices, bins) 
stay finite as time tends to infinity. To the best of our knowledge, this is the first time when a 
third regime as described in our Theorem 11.41 has been proved for any model of preferential 
attachment or Polya's urn type. In the case of weight functions / which give rise to the 
second phase, we devise in our Theorem 11.61 and respectively in Corollary 11.71 a test for 
obtaining an upper bound, and respectively a lower bound, on the probability that a given 
group ends up being dominant. 

The motivation for our model comes from the class of species sampling sequences, to which 
class our model belongs. Species sampling sequences are models for exchangeble sequences 
(Xn) with a prediction rule, that is, a formula for the conditional distribution of Xn+i given 
Xi,Xi, . . . , Xn forn = 1, 2, . . . , n. More precisely, given the first n terms of the sequence (X„), 
Xn+i equals the i-th distinct value observed so far with probability p^+i^j, for i = 1, 2 ... n, 
and otherwise Xn+i is a new value with distribution u for some probability measure v. 
Species sampling sequences were first introduced and studied in [23], [23] and are now used 
extensively in Bayesian nonparametric statistics. (See for example [H], |16j or [18' for more 
on species sampling sequences or for their applications to statistics.) 

We next introduce precisely our model. 

1.2 The main model 

We consider the following model where at each step a new vertex and at most one new edge 
appear according to the following rules. The probability that the new vertex is disconnected 
is positive and may change in time. When a vertex is disconnected from the existing ones, 
it becomes a pioneer vertex. We label the pioneer vertices in order of appearance. Given 
that the new vertex is connected to an existing one, the latter is chosen with probability 
proportional to a reinforcement scheme of its degree. The graph formed with this procedure 



is the union of trees. Each tree has a pioneer vertex as a root. The tree with root i observed 
at time n, is caUed the i-th. group (or z-th component) by time n. 

More formally, fix a collection of positive functions /^ : N — )• M"*" with /^(O) = and fkii) > 
for all i, A; > 1, and a sequence {s„} which takes values in [0, 1]. Set ^i(l) = 1 and Aj{l) = 
for all j > 2. Set Li = 1. We define the random variables Ai{n + 1) and Ln+i recursively as 
follows 



'{Ak{n + l) = Ak{n) + l\Ln,{Aj{n)Mthj£N}) = (l - ^n) /'^ ', ' , , for i < L 

T " 
P(^L„+i(n + 1) = 1 I L„, {^j(n), with j G N}) = s^, 



Ef=i/.(^N) 



while Aj{n + 1) = for all j > L„ + 1. Moreover, 

L„+i =' max{j > 1 : ^j(n + 1) > 1}. 

Notice that Ai{n + 1) — Ai{n) £ {0, 1} and exactly for one index i this difference equals 1. 
The random variable Ai{n) is the cardinality of the i-th group by time n. We call the process 
{Ai{n),i, n > 1} a generalized attachment model whose parameters are the sequence {s„} and 
the reinforcement functions {fk}, abbreviated with GAM({/fc}, {s„}). We emphasize the fact 
that we do not make any assumptions on the update functions {fk}, other than positiveness, 
and {sn} is allowed to be random. As shown in Theorem 11.41 some of our strongest results 
hold for a group-dependent deterministic reinforcement scheme {fj}, that is, where each 
group j follows its own reinforcement scheme fj, independently of the other groups. From 
the point of view of applications, this allows one to take into account the case where different 
groups have different update schemes, which is what would be expected in many real-life 
situations. We use the symbol GAM(/, {sn}) to denote a generalized attachment model 
where the update functions fk are equal to the positive function / for each A; > 1. 

We briefly discuss next the link of our work to the recent literature. The two main models 
that we generalize were studied in detail in the particular case with reinforcement scheme 
proportional to f{j) = p , where 7 > 0. 

Let us look first at the literature on preferential attachment models connected to our 
generalized attachment model. The preferential attachment model studied in [13], [T7], [2T] 
and [26] is just GAM(/, {sn}) for the particular case oisn = m{n)c/[ J2s=i f {As{'n'))+m{n)cj , 
where we denoted by m{n) < n the number of groups (respectively vertices) with no children 
at time n, and where c > 0. Then in the growing network, Aj{n) > 1 represents the 
in-degree at time n of existing vertex j with strictly positive in-degree, that is, vertex j 
has Aj{n) children. With probability /(^j(^))/( X]s=i /(^s('^) +'m{n)c\, a new arriving 
vertex attaches to an existing vertex j with strictly positive in-degree Aj{n); with probability 
m{n)c/[ X^^=i f{As{n)) + r?T,(n)c) , a new arriving vertex attaches to one of the existing m{n) 



vertices with in-degree. For the case where the reinforcement function / is hnear or super- 
hnear, {s„} is bounded away from 1, so we can apply our results to the case of preferential 
attachment models. 



In [2T] the authors look at the preferential attachment model with reinforcement scheme 
fiJ) — [J + ^V ^1 > 1) foi^ which they prove a similar result to our Theorem II. II bv using the 
original Rubin construction. In [3] and [19], respectively in [26], the authors give the limiting 
degree distribution for a wide range of linear, respectively sub-linear, weight functions. 

A different preferential attachment model was studied in [10] and [11]. In this model a 
new vertex arrives at each step and attaches to every existing vertex independently with a 
probability proportional to a concave weight function / of the existing vertex's degree. In [10] 
the authors prove in Theorem 1.5 the same type of phase transition as in our Theorem ll.lt 
and they study the degree distribution. In [TI] they study the existence of a giant component, 
that is, of a connected component containing a positive fraction of all vertices. 

We turn now to the literature on the generalized Polya's urn model. This model corresponds 
to GAM(/, {s„}) in the particular case with Sn = p for all n > hq, for some fixed no G N. In 
[3] the authors consider both the generalized Polya's urn model with p = 0, when the number 
of bins is fixed, and with p > 0, and they prove by combinatorics techniques a similar result 
to our Theorem 1 1.1 1 for the case of power functions. The case with p > and 7 < 1 is studied 
in [5] under two additional assumptions involving the power function /(j) = p , assumptions 
whose validity is left as an open problem in that paper. 

The generalized Polya's urn model with p = was also the main object of study in |20j 
and [28]. In pO] the author studied the case of two fixed bins under a number of technical 
assumptions on the function /, which exclude for example the (super)-exponential functions, 
and which assumptions are stated in Section 4 of that paper. Theorem 3.3.1 in [28] proves a 
result similar to our Theorem 11.11 i) for the case of a fixed number m of bins and under the 
assumption of monotonicity on the super- linear function /. 

Last, we provide below a definition of species sampling sequences and why GAM(/, {s„,}) 
is such a sequence. Consider a Polish space X and let //(•) be a diffuse probability measure 
on X, i.e. ^{{x}) = 0, for all x £ X. Denote with lyi the indicator function of the event A. 
A sequence of random variables Xn, with n > 1, on X which has the following distribution 

n 

F{Xn+i eB\Xi,...,Xn) = Y,Pn+iA{x,eB} + rn+iKB), (1.1) 

is called a species sampling sequence whenever Vn + "^iPn,! = 1, Tn^Pri,i > 0, and rn,Pn,i 
are Tn-i measurable, where Tn = a{Xi,X2, ■ ■ ■ ,Xn}- It corresponds to GAM(/, {sn}) for 
the case with pn^i = fiiAi{n)) / "^^I!^^ fs{As{n)) and r„ = Sn for all n > 1. In particular, 
the Blackwell-MacQueen urn scheme, also known as Chinese restaurant process, is a species 
sampling sequence with the choice s„+i = Pn+i,i = 1/(1 + '^); it corresponds to GAM(/, {sn}) 
with /(j) = 1/j and Sn = Tn = 1/n for all n > 1. 



In this paper we give a complete characterization of the existing phases for a very general 
class of update functions, for the case fj = f and for all non-negative random sequences 
(s„)„gN) with s„ < p < 1 for all n > 1. In particular we do not assume any monotonicity on 
/, and our only assumption on / is for Theorem 11.11 i). and it controls the oscillation of the 
reinforcement function. Moreover, we prove in Theorem 11.41 that for any group-dependent 
deterministic reinforcement scheme {fj}, where {fj} are only assumed to be positive, we can 
only have three possible phases. We prove the existence of a third phase, by an example. We 
emphasize the fact that exactly three phases are admitted for this model. 

1.3 Results 

The following are our inain results 

Theorem 1.1 Consider a GAM(f,{sn}). Suppose that Sn < p, for some p < 1 and all 
n> 1. 

i) If 

oo oo 

y < c«, and y^ exp -^ — c( min f{j)) > < oo Vc > 0, 

n=l I^^^ n=l "'-" 

then there will be, a.s., exactly one group whose cardinality tends to infinity, all the 
other groups being finite. 

a) The cardinality of each (created) group tends to infinity a.s. if and only if 






^^ fin) 



OO. 



Remark 1.2 If we remove the hypothesis that Sn is bounded away from, one, and suppose 
that J2'^=ii^ ~ ^n) < oOj then by Borel Cantelli's Lemma there exists a random time N such 
that for any time n > N a new group is formed. Hence the cardinality of each group will 
remain finite, and only finitely many groups will end up having a cardinality larger than 1. 
We do not study the case o/ lim sup„_^oo Sn = 1 cind X]5^i(l ~ ^n) = co- 

The following result is a corollary to the proof of Theorem 11.11 i) . It generalizes the results 
contained in [21] about the degree of vertices in the preferential attachment inodel. 



Corollary 1.3 If the hypothesis of Theorem \1.1\ i) hold, then lim„_>.oo ^i (™) > 1 for only 
finitely many i. 

The following theorem establishes that GAM({/j}, {sn}) can have only three possible phases. 
The Theorem holds true if the fj are random functions independent of Sn satisfying the 
conditions of the Theorem almost surely. 



Theorem 1.4 Consider a GAM({fj}, {sn})- Suppose that s„ < p < 1, for some p < 1 and 
all n > 1. 



^) If 



> < oo, for at least one created group j £ N, 



then there will be, a.s., at most one group whose cardinality tends to infinity, all the 
other groups being finite. 



ii) If 



oo ^ 

y „ . , = oo, for all created groups jG N, 



then either the cardinality of each (created) group tends to oo, a.s., or each of them will 
be eventually finite, a.s.. 

We show in Example 14.11 that for the cohection of update functions fj{n) = e^^ ~^"'', the 
cardinality of each group remains finite, a.s.. The third phase seems to arise only when for 
fixed n, j — )■ fj{n) is an unbounded sequence. 

The previous two theorems rely on a novel modification of a well-known tool used in 
reinforced random walk processes, the Rubin construction, which embeds GAM({/j}, {sn})- 
We believe that such a generalized Rubin construction as introduced in our paper could have 
wider applicability to other preferential attachment models. 

In the second part of the paper we are going to estimate the probability that a given group 
is the leading one. Our first result concerns a reinforced urn model . Consider an urn with k 
white balls and 1 red ball and with reinforcement scheme /. Then if we pick a ball at random, 
it is white with probability f{k)/{f{k) + /(I)), and red with probability f{l)/{f{k) + /(I)). 
Suppose that by the time of the n-th extraction we picked j white balls and n — j red ones. 
The probability to pick a white ball becomes f{k + j)/{f{k + j) + f{n + 1 — j)). We call the 
urn with these initial conditions and dynamics a reinforced urn model with parameters k and 
/ (abbreviated RUM(A:, /)). Denote by P''"' the probability measure referring to RUM(A;, /). 
We have the following estimate. 

Theorem 1.5 Fix any k>l and consider a RUM(k, f) with Y27Li ^/fU) < °°- ^^ have 



1 ^ f(f)P 
¥'-''^ (^only a finite number of white balls are picked^ < — I I TTiKW^ (l-^) 



fc-1 

~ + f{(^)Fk 



1=1 



where Fk = Ej^fc+i V/O')- 



The above theorem sheds deeper insight on the evolution oiJUJM.{k, f) and on Theorem ll.ll i): 
it shows that the leading side in the beginning has a great probability to stay the dominant 
side. As an example of the power of our bound, take /(j) = j^. In this case, a simple 
computation gives that 

1 vr 

P'*'(only a finite number of white balls are picked) < — exp ( — (A; — 1) H ). 

Hence for large initial weights k the white has an overwhelming chance to be the one 
with cardinality tending to infinity. The estimate in ()1.2p improves Theorem 3.6.2 in |28j . 
Theorem 11.51 should be also compared with Theorem 3 in [2Dj, which is proved under the 
technical assumptions on the update function / stated in Section 4 of that paper. Note also 
that the bound above is an improvement on the upper bound which could be obtained in 
(|1.2p by means of a similar reasoning to the one in Proposition 2.1 and Proposition 3.1 from 
[6]. The cause for this is that the lower /upper bounds in [6] are rough for large initial weights. 
This is one main reason why the methods there only work for finite graphs and not also for 
infinite graphs. Our proof is based on an embedding of RUM(fc, /) into Brownian motion, 
and gives robust estimates for all initial weights. 

Next we turn again to GAM(/, {sn}). Suppose that Yl'jLi^/fU) < °o. Theorem 11.11 i) 
guarantees the existence of a unique group whose cardinality goes to infinity. We call this 
the leading group. Denote by Lead the label of the leading group. In other words, Lead = j 
if and only if the leading group is the j-th one. Our goal is to test if a given group, which 
has a certain advantage on the others, is the leader. We start by giving an upper bound for 
the tail of Lead. 

We give the following construction of GAM{f,p). Suppose we have two sequences of 
random variables, 6„ and t{n), satisfying the following. The variables 6„ are i.i.d. Bernoulli 
with mean p, while the variables t{n) are described recursively. We define ^*(1) = 1, and 
A*{1) = for all i > 2. Moreover, set L^ = 1. Denote by J-"„ the cr-algebra generated by 
{(6j,t(i)), with i < riY Suppose we defined A*{n), which is J>i_i-measurable. The random 
variable t(n) can be chosen to have the following distribution 

P(t(n) = k I J-„_i) = {^^^k^""^^ . 
EskfsiAtin)) 

Moreover, we can choose t{n) to be independent of bi with i > n + 1. Denote by 

1*( 



L; =^ max{j >1: A*{n)>l}. We define 



A*{n + 1)=^ 



Vj>l; + i, 

1 if br,. = h 



-'n 



.4* (n) + %„)=,} if6„ = 0. 

) > !}■ 
distributed like the process {Ai{n), i,n G N} described in section [L2l At time n, bn will 



Finally, let L;^/= max{j > 1: A*{n + 1) > 1}. We have that {A*{n), i,n E N} is 



determine if the new vertex is disconnected, and t(n) will determine to which of the existing 
vertices the new arrived will adhere if it is not disconnected. Notice that f (n) is defined also 
in the case that bn = i, i.e., in the case that the new vertex is disconnected. We denote 

def 

by ^1 = and ^i = inf{n > ^j_i : bn = I}- In words, ^j is the time when the i-th group is 
formed. We say that the i-th group is generated by the u-th group if t{^i) = u, i.e. if we 
flipped the value of b^. into 0, then the new arrival would have joined the group u. In this 
case we say that u is the parent of i. Notice that there exists exactly one parent for each 
integer different from one. We build a random tree Q, whose root is one, joining each integer 
to its parent. We say that a vertex is at level n if its distance from the root is n. Denote by 
gn the vertices at level n. Let Gn = ^j>ngj- We have 

Theorem 1.6 Suppose that the assumptions of Theorem \1.1\ i) hold. Then 

P(Lead G G„) < inf fm^e-^^^'''^^)" + r"" + Ci exp { - MCa}! , 

r,_A/>l L J 

where the sequence Cn{r,M) — )• oo as n ^ oo, for fixed value of r > 1 and M > 1, and 
m,Ci,C2 > 0. The quantities Ci, C2 and m are computable. The functions Cn{r,M) are 
computable for fixed values of r and M. 

The following result is a direct consequence of Theorems 11.61 and 11.51 

Corollary 1.7 Suppose that the assumption of Theorem \1.1\ i) hold. Then 

P(Lead= 1) >l-{±lYl^m\ if pe--*«)+.--C,exp{-MC,}" 

k=i e=i ■' ^ ' '^ ' - 

where the quantities Cn{r,M), m and Ci and C2 are the same as Theorem \l.(A 

The rest of the paper is structured as follows: in Section [2] we introduce our generalized 
Rubin construction and give the proof of Theorem 11.11 (i) , in Section [3] we give the proof of 
Theorem ll.il (ii). In Section H] we prove our main result Theorem 11.41 and present an example 
where the third phase occurs, in Section [5] we introduce our Brownian motion embedding 
and provide the proof of Theorem 11.51 In Section [6] we give the proofs of Theorem 1 1 . 6 1 and of 
Corollarv ll.71 Finally, in the Appendix we give a brief introduction to the Rubin construction, 
as introduced in [7]. 

2 Proof of Theorem 11.11 i) 

We introduce a modified version of the Rubin construction which fits our model. For a 
detailed explanation of the original Rubin construction, see for example |6j and \[l\. 



Fix a parameter p < 1. We first focus on tlie case s„ = p < 1, i.e. GAM(/,p), tlien we 
extend to tlie more general case Sn ^ p using a coupling. For any set A C M^, let 

A[n] =inf{x: #{An [0,x]) > n + 1}, 

where the infimum of an empty set is oo. In words, A[n] is the n + 1-th element of A, ordered 
from the smallest to the largest. For example, if A = {2, 8, 6, 9}, then A[0] = 2 and A[l] = 6, 
^[5] = oo. Notice also that for the example A = {1/j ■ j > 1}, is not possible to identify the 
n + 1-th element. In fact, in this case, we have that A[n] = for all n > 0. 

Notice that A[n] is always a non-decreasing sequence, hence lim„_>oo^['i-] exists, possibly 
infinite. For each i G N, let {Wn\n > 1} be a sequence of independent exponential 1) 
random variables, with n G N. Moreover let {Rn ,n > 1} be a sequence of i.i.d. Bernoulli 
such that P(i?n' = 1) = P- We are going to use these sequences to generate a GAM(/,p). 
The Bernoullis will be used to create new groups, while the exponentials play a central role in 
the allocation of new individuals into existing groups. We are assuming that all the variables 
involved are independent of each other. Set iVm,(l) = 1, for all m > 1. Then, for n > 2, let 

N„,{n) =^ l + #{j: j <n-l such that i?J"> = 0}, 



='1 



{0}u|y , \^. :n>l} C 



def 

In words, for each m > 1, the processes Nj„ = {Nm[n),n > 1} are independent processes 
with the property that Nm{n) — 1 are distributed like binomial with parameters n — 1 and 
1 — p, while Hi is a random subset of M"*" composed by and all the partial sums of the 
sequence {VFj //(A'^i(i)), with z > l}. To each element Hi we associate a corresponding 

Bernoulli as follows. Let (71 : Hi — )■ {0, 1} be a random function defined by (7i(Hi[n]) = Rn ■ 
The elements in Hi with corresponding Bernoulli equal to one, are used to generate new 
groups for GAM(/,p). The other ones will potentially belong to the first group and will be 
labelled one. We will clarify the last sentence at the end of the construction. Define 

Hi 1i/ {0} U I y ; %,, : n > 1 and i?(,i' = Ol, 

i.e., Hi is composed of {0} and all the points in Hi \ {0} with Bernoulli equal to 0. These 
are the points which do not generate other groups. We label the points in Hi with 1. Set 
Ti = and define 

r2 =^inf{n>l: R^l^ = 1}. 



The random variable T2 is the time when the second group is formed. Given T2, let 

n fimi)) 



" TI/{2) 
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and 

Ha = \ Hi[r2] + V , \ ,, : either n = or both n > 1 and i?<f> = L 

We label the elements of H2 using 2. Define the function (^2:^2 — ^ {0, 1} as follows. If 
S2[n] = Hi[j], for some j G N, then 5^2 (S2M) = -Rj-^'- The latter is well-defined because 
all the elements of Hi are a.s. distinct. If H2[ra] = (H2 \ Hi)[j] for some j G N, then 
g2{^2[n\) = R^P ■ Notice that Hi and H2 are disjoint and their union is a proper subset of H2. 
Let us describe in words the variables defined so far. The reinforcement plays no role up to 
time T2- The latter random variable is geometrically distributed with mean 1/p. At time r2, 
the first group has cardinality T2, because we count also the point 0, and a second group is 
formed. The random point H2[t2] is labelled 2, in fact it belongs to H2, and it is the smallest 
point belonging to this random set. The next point on the line, i.e. H2[t2 + 1] can have label 
1, 2 or no label at this stage. If the latter happens, we label this point with 3. If it belongs to 
Hi, respectively H2, its label will be 1, respectively 2. Notice that by the definition of these 
sets, if H2[t2 + 1] G Hi U H2 then 52(^2 [''"2 + 1]) must be equal to zero. On the other hand, 
in the case that g'2(H2[r2 + 1]) =1 then a new group is formed, which is labelled 3. The 
probability that this happens is p. Next we want to compute the probability that H2[t2 + 1] 
has label 1. We have the following equality 

{H2[r2 + 1] G Hi} = {Hi[t2 + 1] - Hi^] < H2[l] - Hi[r2]} n [g2{^i[T2 + 1]) = o}. (2.1) 

Note that H2[t2] = Hi[r2]. Given r2, the two events appearing on the right-hand side of 
(|2.ip are independent, because the first one depends on the exponentials while the second is 
determined by the Bernoullis. The probability of the second event, conditionally on r2, is 
1 — p. If the random variable H2[t2 -|- 1] was labelled 1, then it would belong to Hi and would 
be equal to 

Si[t2 + 1] = E ^''V/(A^iW) = SiN + <^i//(iVi(r2 + 1)). 
1=1 

If H2[r2 + 1] was labelled 2, then it would belong to H2 and would be equal to H2[l] = 
H2[T2] + H^fV/(l)- Hence 



i[t2 + 1] = H2[r2] -Fmin 



'-'2[t2\ +min 



<^i w['^ 



/(iVi(r2 + l))7(l) 

/N'/(i) 



where we used Ni{t2 + 1) = T2- This last equality comes from the fact that among Rf, with 
^ ^ 1"2) the only Bernoulli taking value one is -R7-2 • As Ni{t2 + 1) equals one plus the number 
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of zeroes among the first T2 Bernoulli, it is equal to T2. The first event on the right-hand 
side of (I2.ip can be rewritten as 

Given T2, the random variable W^rj //("^a) is exponentially distributed with mean l//(r2). 
By a simple integration, we can argue that the probability that, among two independent 
exponentials, a given one is the smallest is equal to its parameter divided by the sum of the 
parameters. Hence the probability of the event in ()2.2p . conditionally on T2, is /(T2)/(/(r2) + 
/(I)). The probability of the event described in (|2.ip . conditionally on r2, is 

(i-P)- ^<-' 



7(r2) + /(l) 



We infer that the conditional probability that H2[r2 + 1] is labelled 2 is (1 — p) f (l) / {f {T2) + 
/(I)). This is consistent with what happens in GAM(/,p). 

Suppose we defined [T2,'Si,Ei,gi, . . .Tm-i,^m-i,^m~i,gm~i)- Define 

Tm = inf{n > Tm-i: gm-i{'^m-i[n]) = 1}, 
i.e. the time when the m-th group is formed. Given t^ let 



^ def^ ,,r^ , , ,X^ W^ 



(m-l) 



H™-iu{H_,[r„]+g-^^^^:n>l}, (2.3) 

and 

H^ =^ j H„_i [t„] + V TTirrT^ : either n = or both n > 1 and iJ^T' = . (2.4) 

The elements of H^ are labelled m. Moreover let g^ be defined as follows. If there exists j 
such that Em[n] = H„_i[j] then gm{Em[n]) = gm-i {Em-iij]) ■ If H„[n] = (Hm\H„_i)[j] for 
some j, then set ^^(^^[n]) = i?^™'. 

Denote by H = IJ^i '^s- Each point x £ E belongs, a.s., to exactly one H^ for some s > 1, 

def ^ — 

i.e. E = IJ^i"s- In our construction, we label the point x with s if and only if x G Eg. 
Define the random function g: E ^ {0, 1} as follows. If E[n] = Ej[s] for some (a.s. unique) 
pair (i, s) G N^, then g(H[n]) = Rs\ Notice that E can be used to generate a generalized 
attachment model, as follows. Denote by 



Ai{n) = {j : j < n, E[j] has label i}. 

Then {Ai{n), with i,n > 1} is distributed like the process {Ai{n), with i,n > 1} introduced 
in subsection 11.21 To see this, suppose that in the set {E[i], with i < n} there are exactly 4 
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points labelled i, with X^i^i ^i — ^ fo^" some m G N satisfying also ii > 1 for all i G {1, . . . , m}. 
Given this, the probability that H[n + 1] is labelled m + 1, i.e., the probability that 5(5(71 + 1]) 
equals one, is exactly p. Given that H[n + 1] is not labelled m + 1, then the probability that 
it is labelled j, with j < m, is exactly 

^<^'' (2.5) 



where we used the menioryless property of the exponential random variables. In fact, using 
this property, given that H[n + 1] is not labelled m + 1, the random variable H[n + 1] — H[n] 
is distributed like the minimum of m exponentials with parameters f{is), for 1 < s < m. 
The probability that the j-th exponential is the minimum is given exactly by ()2.5p through 
a simple integration. Summarizing, given that in the set {H[i], with i < n} there are exactly 
ii points labelled i, with "^^i ii = n and X^™!^ ii < n for some m G N, the probability that 
S[n + 1] is labelled j, with j < m, is 



Define 

def °° W-"^'' 

and for any integer j > 1, set 

n 

E* "^ {h[t,] + Y, W^^^/f{N,{s)) : n > 0}. (2.7) 

s=l 

In the next result, we prove that xj^ is a.s. finite, for any m > 1. This, together with (|2.3p 
and ()2.4p . implies that x^ is an accumulation point for H^ and Hm- We say that a vertex 
u is generated by j if H[ru] G H*. Notice that each vertex (different from 1) is generated by 
exactly one other vertex. Our proof of Lemma 12.11 relies on the construction of a random 
tree T, built by connecting each vertex to its parent. Notice that this random tree shares 
the same distribution with Q, introduced before Theorem 11.61 Suppose that r^ = t. If we 
switched ^(^[i]) from 1 to 0, we would have that H[i] would have been a point of Hj, hence it 
would have had label j. Fix j, n G N. Notice that even if the Bernoulli associated to the point 
S^[n] equals 1, this point might not be able to generate a child in T using the exponentials 
and Bernoulli that have been defined so far. This is the case if #(H n [0, H* [n]]) = 00, when 
infinitely many vertices have already been generated by the time we reach H^[n] and all the 
(W,i , i?n ) have already been used. This is going to be an important point in the proof of 
Lemma [ 



Lemma 2.1 The random variables x^, with m > 1, are almost surely finite. 
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Proof. Fix m > 1. Set Zm{0) = and let Z„(i) = inf{n: Nm{n) = i}. Then Zm{i) - 
Zm{i — l)j with i > 1 are geometric (1 — p) and are independent of the W^ , with i > 1. If 
Zmi^ <k< Zm{i + 1), then /(iV^(A;)) = f{i). Hence 

oo Zm{i+'i-) — l w'™) 



+ E E 



oo Z„(j+1)-1 

-m-Et^ E -r 

As the series in the latter expression is composed by non-negative random variables, it is a.s. 
finite if its mean is finite. Its mean is exactly 

y^7-<oo. 2.8 

To see this, notice that Zrn{i), i > 1, is independent of W!- , j > 1, which implies 

^m(i + l)-l 



1 — p 

j = Zm{i) 

Moreover, we have that H[rm] is stochastically smaller than 

—^T7^yw['\ (2.9) 

This is because S [n] — S [re — 1] is stochastically smaller than an exponential random variable 
whose mean is smaller than l/(minj /(z)). Moreover, the random variable t^ is negative 
binomial with parameters m and p. This can be checked by induction, in fact ri is geomet- 
rically distributed with mean 1/p. Suppose this is true for Tm-i, then we have to wait an 
independent geometric(p) to create the next group. Combining this fact with (|2.9p we have 
that H[rm] < oo a.s.. This, together with ()2.8p implies the Lemma. □ 

In the next result we establish the link between the behavior of the generalized attachment 
model and the quantity infj x*. 

Lemma 2.2 The infimum infjX*, is a.s. attained, i.e. it is actually a minimum. The 
minimiser is a.s. unique. Moreover 

lim H[n] =infa;*, a.s.. (2.10) 
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Proof. We select a random subtree of T, denoted by 7i, as follows. The root of this tree is 
1 (i.e., it is identified with the first group). Given that a vertex j belongs to 7i, its offspring 
will be those vertices u such that 

E[tu] G H*, and H^il] < x*. (2.11) 

Recall that H[t„] = H„_i[t„] = H*[0]. We are going to prove the following statement. 

For any fixed M, only finitely many of the vertices n of 71 satisfy Sn[l] > M. (2.12) 

Before we prove ()2.12p we argue that this statement would imply the Lemma. We need only 
consider the vertices of 7i. In fact, if j is not a vertex of 71 then there exists a vertex u such 
that Hj[l] > X*, which implies that x*, > x*. Hence x* ^ infjX*. 

If ()2.12p holds, then for any M there are only finitely many vertices n in 71 such that 
X* < M. Hence, as each x* is a.s. finite, we have that inf j x* is actually a minimum. Next 
we prove that the minimizer is a.s. unique. To prove this last statement, we prove that for 
each i > j, we have that x* and x*- are a.s. different. To see this, notice that x* — H[rj] only 
depends on {Wn\Rn, with n > 1}. Hence x* — H[ri] is independent of x* — H[tj] which 
is determined by a disjoint collection of exponentials and Bernoullis. The probability that 
X* — H[rj] and x* — H[ri] are equal is 0, as they are continuous independent random variables. 
This is exactly the probability that x* = x"^- As the set of x*, i > 1, is countable, x* are all, 
a.s., distinct. 

Next we show that (|2.12p implies ()2.10p . As already mentioned, the sequence H[n] is a.s. 
non-decreasing, i.e. H[n + 1] > H[n], a.s.. Hence lim„_>.oo H[n] a.s. exists. Notice that for each 
i, X* is the limit of an increasing sequence taking values in H. To see this, notice that 

n ^,(^) 



^t ' 



by the definition of x*. Hence infinitely many points labelled i are smaller than x*, yielding 
#(Hn [0, X*]) = cxD. This implies that lim„_>.oo H[n] < x* for each i> I, i.e. lim„_!.oo S[n] < 
infjX*. Now we turn to the proof of the other inequality which implies (|2.10p . Fix e > 0. It 
is sufficient to prove that (|2.12p implies 

#{u: Eu[l] < inf X* - e) < oo. (2.13) 

i 

In fact, if (|2.1c{p holds, only finitely many u satisfy #(5^ Ci [O, infj x| — e] ) > 1. Denote the 
set of labels of these groups by B. For each element u of the finite set B, there are only 
finitely many points of H* which are smaller than inf j x* — e, for otherwise we would have 
X* < inf j xj — e which would yield a contradiction. Hence, for each element u oi B, the set 
H* n [ inf j X* — e] is finite. For each j ^ B we have that there exists a u G B such that 
Ej[0] G H*. This implies that 

Hn[0,infx*-e] = M H^ n [inf x* - e] , 

uGB 
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and the latter is a finite set. Notice that infj x* — e could take a negative value, but this is 
not a problem for our reasoning, as then the set appearing in (|2.13p would be empty, and 
there would be nothing to prove. Next, we prove that (|2.12|) implies (|2.13|) . Fix a vertex 
u of 7i. Denote by m(0) = l,u{l),u{2), . . . ,u{n) the ancestors of u in 71, i.e. the vertices 
lying on the unique self-avoiding path connecting u to the root 1. Notice that we do not 
consider u ancestor of itself. If u satisfies S„[l] < infj a;* — e, then u belongs to 71. In fact, 
S„[l] > H*(.-)[l] while infj x* - e < a;*^.,^), for alH < n + 1. Hence, H*(.)[l] < x*(._-^), where 
we set u{n + 1) = u. As we are assuming that ()2.12p holds, the random tree 71 has only 
finitely many vertices j satisfying Hj[l] > M. As infj x* < oo, a.s., we have that (|2.13|) holds. 

Next, we are going to prove (j2.12p . For any vertex j in 71, denote by aj the number of 
its offspring. Notice that the aj are neither independent nor identically distributed and 71 is 
not Galton- Watson tree. To see this, fix j,n > 1. If there is an infinite number of elements 
of H to the left of H|[n], i.e. 

#(Hn[0,H*[n]])=oo, (2.14) 

then already infinitely many groups have been created. Hence H*[0] < H*[n] for all u G N. 
This implies that Hj[n] cannot generate any new group in 71 using the exponentials and 
Bernoullis defined so far, because they have already been used. To overcome this problem, 
we create a new tree, larger than 71, by introducing new random variables which allow also 
the observations H*[n] satisfying (j2.14p to create a new group. To this end, we should attach 
to each of these observations a new sequence of independent exponentials and independent 
Bernoullis. For example, if H*[n] satisfies ()2.14p . and the associated Bernoulli equals one, 
a new group, that we label v, is created (notice that we cannot use any of the integers 
as a label, because they are already all taken). In this case, we set H,y[0] = H^[n]. We 
denote the associated sequence of i.i.d. exponentials with mean 1 by Wn\ and let i?*"' be 
the Bernoulli associated to group v. We define S* and Hj^ using these random variables, 
as we did in ()2.7p and ()2.4p . If the group u satisfies the second requirement in ()2.1ip . i.e. 
W^ /f{l) < X* — H*[0], then u belongs to the new tree 72 that we are going to define. 
But then, we would have to allow that u is able to generate groups as well, in the same 
fashion. This approach would require that we introduce new sequences of exponentials and 
Bernoullis, and the notation would be quite awkward. Hence we prefer a different approach. 
Before we proceed in a formal description of 72 , notice that for this tree the number of 
offspring per vertex are independent and identically distributed. In fact, on the set {v is 
generated by j}, we have that E^[0] ^ Sj[0], because the latter belongs to Ej. This implies 
that H,y[0] > H*[l]. Hence x* — H,y[0] is independent of H|[l]. In fact, the former is determined 
by the exponentials and Bernoulli attached to j excluding W^ , while S^[l] is determined 
by Wl and exponentials and Bernoullis attached to vertices different from j. Moreover, 
analyzing the event {Wl"^ /f{l) < x* — H*[0]}, one can easily argue that it does not depend 
on the exponentials and Bernoullis attached to vertices different from u and j. Summarizing, 
the number of offspring of j in this new tree depends only 

• on the exponentials attached to j, with the exception of Wl^\ 
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• on the Bernoullis attached to j, 

• and on W^f" , ii u = H*[n] for some n and Rfi = 1- 

This imphes that the number of offspring per vertex are i.i.d.. 

Now we are ready to give a formal construction of Ti- Suppose that to each x £ E^ we 
associate an extra exponential random variable Qx- Let 

77/= #{x G El: ejfil) < xl-xand g{x) = 1}. 

The previous random variable counts also the n satysfying #(H n [0,HJ[n]]) = 00, hence 771 
is stochastically larger than ai for any i. Then the Galton- Watson tree T2 whose offspring 
distribution is the same as the one of ryi is stochastically larger than 71 . We assume that 72 
is built on the same probability space of 7i. In other words, we can assume, and we will, 
that 7i is a subtree of 72- Next we prove that the average number of offspring is bounded by 
a finite constant m. Define 

00 
^fc/= {<'//(!)< E W^i^V/(A^, («))}• (2.15) 

s=k+l 

Notice that we should have used different exponentials(l) instead of W^" , but the two share 
the same distribution and are independent of the right-hand side, and this notation is easier 
to handle. Of course, we are allowed to do that because we are interested only in estimating 
the probability of this event. 

We have that 

00 00 

E[r?,] < IE[Eln,J = Y.^{^^^j)- (2.16) 

fc=i fc=i 

In order to prove (j2.16p . notice that on the left-hand side we count the number of elements 
in H* with Bernoulli equal to 1, and which satisfy an extra condition. The right-hand side 
counts only those vertices which satisfy the extra condition. Hence we only need to prove 
that P(i7fc.j) is summable. Notice that P(r2fcj) is independent of j. 

def 

Set Ofc = mins>A; f{s) and define 

00 
r'i/{fceN:(l-p)2afc+i</(l)or ^ l//(^) > 0.05(1//(1))}. 

s=k+l 

The set V is finite. This is implied by the facts that a^ ^- 00 and YlT=k+i ^/fi^) ~^ 0- ■'^o^ 
/c G r we use the trivial bound ¥{Qk,j) ^ 1- In the following, we fix A: ^ F, i.e. we assume that 
k satisfies (1 - p)'^ak+i > /(I) and E^fc+i l//(s) < 0.05(1//(1)). We prove the following 
bound, 

IF'(f^fcj) < e"^^'"=+^ for/c^r, (2.17) 
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where Ci = 0.15(1 - pf {I / f {!)) . Denote by ^ = E^fc+i ^i"//(^^i(s)), and Z 
W^f 7/(1) . Set ^ = (1 - pfau+i. We have 



^[»"'-'i-nE(7#7)v-(i-rt 



s=k+l j=l •' ^ ' 



nr 



s=fc+l 



-(e/(/(.)(l-p))) 



<exp{30 ^ l//(s)}, 

s=fc+l 

where we used that 0/{{l — p)f{s)) < {1 — p) < 1 for s > k and our choice of 6, and the 
inequality 1 — x > e~^^', for x G (0, 1). Moreover 

^ J-^ + /(l)-' 

where we use that k ^ T (hence 9/ f(l) < 1), and the inequahty 1 + x > e'^'^^, stih valid for 
X G (0, 1). This implies that 

P(Jlfc,i) = ^{Yk - Z >0) = F^e^^^e-^^ > 1) 

oo 

<exp{-0(O.3//(l)-3 Y. l/fij))} (2.19) 

j=k+i 

<exp{-0(O.15//(l))}, 



where in the last step we used that X^^^j+i 1/ f{s) < 0.05(1//(1)) which holds because we 
picked k ^T. The last inequality in (|2.19|) proves (|2.17|) for our choice of 9. 

Hence 

oo 

E[rjj] < E[Y^ In,,,] < Yl 1 + J^e-^i'^'^+i =^m < oo, (2.20) 

k=i ker kfr 

where, for the finitess of m we used the second assumption in Theorem 11.11 i). and the fact 
that r is a finite set. 

For each vertex li in Ti, recall that we denote by H*[0] the time when this vertex was 
generated and by H* [n] = H* [0] + X]fci ^i"V/(-^«(i))- This is consistent with our definition 
given in (|2.7p . but now it is defined for indices which are not necessarily integers. Next we 
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prove that each vertex u at level n + 1 has a probability to satisfy S„ [1] < M which decreases 
faster than e"'^" for any c > 0. For any vertex u G Ti we denote by \u\ its distance from 
the root of the tree. Recall that the set of vertices at distance k from the root is called level 
k. Fix a large parameter M. A vertex u of Ti is good if the element which generates u 
is smaller than M. A path is a (possibly finite) sequence of vertices u{i),i > 1, such that 
u{i + 1) is generated by u{i). We say that a path connects vertex a to level n + 1 if the first 
element of the path is a and the last lies at level n + 1. We build the following random path 
u. We start from 1 = u{0) and if this vertex has at least one offspring in 72, we choose one 
at random assigning the same probability to each offspring. We denote its label as u{l). If 
u{l) has at least one offspring, we choose one of them at random and denote its label by u{2). 
We follow this procedure until we either reach level n + 1 or find a vertex with no offspring. 
The event {the path u connects 1 to a vertex at level n + 1} equals the event that each of 
the u{i) has at least one offspring. Hence 

n 

{the path u connects 1 to a vertex at level n + l} = || {tJuU) > l}. 

1=0 

Notice that each event {r/„(j) > l} is independent of H*/^_-^Jl] and is independent of each 
WJf with £ < u{i — 1), and k > 1. Moreover the events {7?u(i) > l} are independent. Define 



j=i 
Fix k > 1. We have 



n 



^{u{n + 1) is good | the path u connects 1 to a vertex at level n + 1) 

n 

— ^\ Z-1 '^u(i)[M ~ '^u(i)M — ^ I the path u connects 1 to a vertex at level n + 1 



1=1 



\ ^ ^7* 



i=l 



i=l j=l 

n n 

< IP^ E (^«»[l] - Ki^)my{v.,.,<k} <M\f] {^„(,) > 1} n ^{n,k)) + 

1=1 i=l 



+ P(vl/^(n,fe)| f|{^«»>l 



i=l 

(2.21) 
In the last step we used that for any triplet of events A, B, C we have 

P(A \B)<F{A\ BnC) + FiC I B). 
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Next, we bound the last probability in (|2.2ip . 

n 

[T'{n,k)\ n{^«W>l}^ 

n 

^{Y. ^{'^"W ^ ^} ^ 0.5n I f|{r?„(,) > 1}) (2.22) 

71 n 

^]1{77„(,) > fc} > 0.5n I f|{7?„(,) > 1}' 



i=l i=l 

Let ^„(i), i < n, be i.i.d. random variables taking values in N, with distribution 

niu(i) >k)= P(r?^,) > k I 77„(,) > 1) = ^^"^""^^ ; ^1 =^ qu, for A; > 1. 

The sequence qk is independent oiu{i) because the random variables rji are i.i.d.. Moreover, as 
the 77„(j) are independent, ^11=1 ^{Vu{i) > k} is, conditionally on nf^;^{r7„(j) > 1}, binomially 
distributed with mean nq/.. If X is a binomial with parameters {n,q) then 

nX > 0.5n) < exp{ - (ll„(l) + -^l„-^)„}, (2.23) 

by a simple exponential bound (see, for example [8] pages 27 and 35). 
Fix r > 1. We can choose K* such that gx;? < 1/2 and 

^llUn(i) > i^;} > 0.5n) < exp|-(-^ln(-^) + -— i r In ^rr-^ r)n| < ^^, 

■fr^ / I '2g/^. 2gK; 2(1 -^i^.) 2{1 -qK*)' J (rm)" 

where m has been defined in (|2.20p . We can choose such K* because 

limfc_i.oo(l/(2Q'fc)) ln(l/(2gfc)) = oo. Notice that for any k > K*, we have qk < qx* < 1/2. 
Moreover if A; > K* then 

r,ll 1 1 .-. 1 

""""P 1^2^ ""^2^^ ^ 2(1 -gfc) "" 2(1-9^)'''"/ - (rm)"- 

This fact is due to the monotonicity of q^ and the convexity of the function 2x ln(2a;) + 2(1 — 
x) In 2(1 — x), for x G (0, 1), and the fact that this function attains its minimum at 1/2. Next, 
let (ej) be a sequence of i.i.d. random variables with distribution 

oo 

P(ei < x) = p(PFf //(I) < X I lyf < Yl Wi'^/fi^iit))) ■ 

t=K*+l 

In words, ej is distributed like an exponential with mean 1//(1) conditioned to be smaller 
than Yk* ■ We claim that the first probability in the last equation of ()2.2ip is smaller or equal 
to 

[O.SnJ 

P( Yl ei<M). (2.24) 
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To see this, notice that by a simple exchangeabihty argument we have that 

n n 

n L0-5nJ 

[O.SnJ [0.5nJ 

4=1 i=l 

Again, notice that the events {1 < rjuU) ^ ^rli with i < n + 1, are independent. Given 
{1 < ?7u{j) < K*}, the random variable H*/^Jl] — S*.^JO] is stochastically larger than e^, as 

X^^^j VKs //(-^i('5)) is a.s. decreasing in k. This proves the relationship between ()2.24p and 
the first probability in the last equation of (|2.2ip . Next a simple exponential bound, which 
uses the fact that Cj are independent, yields 



L0.5nJ ^ [0.5nJ 

p, < 

0.5M 

L0.5nJ 



^^ ^ ^0.5n -^ - 0.5- 



< exp{-c„(r, M)n}, 

where Cn{r, M) — >■ oo as n — )• oo. For each n, Cn{r, M) is the Fenchel-Legendre transform (i.e. 
we minimize the exponent on 9) of ej in the point q-^m- Hence, the number of good vertices 
in 7i at level n is smaller or equal to 

m'^(exp{-c„(r,M)n} + -^). (2.25) 

Hence only finitely many vertices in Ti are good. This implies that only finitely many vertices 
in 7i are good, and this, in turn, implies ()2.12p . □ 

Proof of Theorem 11.11 i). First suppose that Sn = p < 1. The minimum of inf j x| is a.s. 
unique, and we denote it by J* . By Lemma 12.21 linir,^^ ^[n] = x}, , hence the cardinality of 
group J* tends to infinity, while the cardinality of each of the other groups is finite. 

Now we reason for general s„ < p, using a simple coupling. Let {Si,i > 1} be a sequence 
of independent Bernoullis with F{Si = 1) = Si/p = 1 — F{Si = 0). We use these random 
variables to relabel the points in H as follows. If S*! = then we set 0i = H2[t2] U H \ H2. 

^ def 

If 5"! = 1 then 0i = E. Define T3 = inf{n > r2 : g{@i[n]) = 1}. Suppose we have defined 
@m-i and Tj, for i < m. On the event {X^i^T '^i ~ ^}' ^^ ^m = 0, respectively Sm = 1, 

^ ~ ^ def 

set 9m = Hfc+i[rfc+i] U Qm-i \ Sfc+i, respectively 6^ = ©m-i- We set r^+i = inf{n > 
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Tm'- g{Qm[n]) = 1}. Let = PljQj. The process 0[n], with n > 1 is a GAM(/, {s„,}). Let 
i^in) = ^?=i Sj. Denote by h{i) = inf{n: K(n) = i}. This imphes that the i-th group in 

def ~ 

is the /i(i)-th group in H. Let Ui = '^h{i)i ^^^ 

<'=4w (2-26) 

This imphes that m.ij{u* : j > 1} is actually a minimum and has a unique minimizer. Fol- 
lowing the same reasoning given in the previous paragraph we conclude that the only group 
whose cardinality grows to infinity is K*. D 

Proof of Corollary 11.31 We first assume that s„ = p. For any i, denote by E{i) the set 
of groups which are generated by i. In virtue of ()2.20p . we have that 

def ~ 

V{u) = {Hj[l] < x* for only finitely many i G -E'(n)} holds a.s.. (2.27) 

Notice that for u which is not a vertex of 71 we have that H„[l] > inf j x* = lim„_j.oo "[?i-]- 
Hence, we do not have to consider such u. Recall the definition of Gm given before The- 
orem 11.61 As for each A^, there are only finitely many good vertices in 7i, we get 
limTv-j-oo IF'(Lead G Gn) = 0. Combining the latter limit with (|2.27p we have that 

P( lim Au{n) > 1 for only finitely many u) 

= P(Hu[l] < inf X* for only finitely many u) 

i 

= lim F({Hu[l] < inf X* for only finitely many u} D {Lead ^ Gn}) 

TV— >-oo i 

> lim Pf n y(u) n {Lead ^ Gtv} 
= lim P(Lead ^ Gn) = 1- 

For the general case s„ < p, apply the same coupling we used at the end of the previous 
proof. □ 



3 Proof of Theorem 11.11 ii) 

We first deal with the case s„ = p. Repeat the construction given in the proof of Theorem ll.il 
i), under the hypothesis of Theorem ll.ll ii). Recall the definition of S*, Eu and x*. Recall 
also the definition of 7~. The random variables x* , for u > 1, are a.s. infinite, because the 
infinite sum of independent exponentials is finite if and only if its mean is finite. We prove 
next that for any fixed Af > 0, 

liminfH„[l] > M, a.s.. (3.1) 
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Fix a vertex Un of T, and denote by Ui, with i < n — 1 its ancestors: i.e. H^^. [tuj] G ^m „i5 for 
all j < n. Then Hu^[l] is stochastically larger than a sum of n — 1 i.i.d. exponentials with 
parameter /(I). Hence lim„_>oo Sm„ [1] = oo, a.s.. Now notice that H* n [0,M] is a.s. finite 
for each s > 1. Hence, as u grows to infinity, the number of its ancestors grows to infinity, 
proving (j3.ip . Since it is easy to adapt the above reasoning to the case Sn < p, we will leave 
this task to the reader. 

4 Proof of Theorem 11.41 

Proof of Theorem 11.41 i). We first analyze the case s„ = p. We build a similar con- 
struction as the one given in the proof of Theorem II. H the only difference being that we 
place fi instead of /. We leave to the reader to check that this construction embeds our 
GAM{{fj},p). In this setting 



Xa — ^i Ti 



^^,Mmj))- 



Notice that x* is a.s. finite if and only if Xl^i ^/ fii^) is finite. Hence, we do not exclude 
that X* = oo, a.s., but we know that 

there exists at least one j for which x* < c«, a.s.. (4-1) 

Denote by y* the smallest accumulation point of H. This minimum accumulation point 
exists because the set of accumulation points of H is closed, and the set H is a subset of M"^. 
Moreover y* is a.s. finite because of (|4.1|) . If y* < x* for all i > 1, then 

lim E[n] < x*, for all i > 1. (4.2) 

We need to prove (|4.2|) only for the case x* < oo, because for the other cases the result is 
implied by the fact that y* is an accumulation point which is a.s. finite and #Hn [0, y* + e] = 

def 

OO. Assume that x* is a.s. finite and notice that for fixed i, as y* < x*, then 6i = {x*+y*)/2 < 
X*. As y* is an accumulation point for H, then #Hn [0, (5j] is a.s. infinite. In words there are 
infinitely many points of H smaller than (5j. Hence 

lim H[n] < 5i < x*. (4.3) 

n— j-oo 

The inequality in (|4.3|) holds for each z, yielding (|4.2|) . Moreover, (|4.3|) implies that each 
group will end up having finite cardinality. This is because #Hj n [0,5i], as 5i is strictly 
less than x* which is the only accumulation point for Hj. The latter statement is a direct 
consequence of the definitions of x* and H j . 

On the other hand, if y* = x^ for some i, then using again that all the x* which are finite 
are also a.s. distinct, we have that lim^^oo '^VA = 3;|. To prove the latter inequality, suppose 
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it is not true, i.e. lim„_^ooSM < x*. Then there would be an accumulation point smaller 
than y*, which would yield a contradiction. 

Next we analyse the general case, i.e. s„ < p, for some p < 1 and all n > 1. The problem 
here is that the reinforcement function is group dependent. In the special case s„ = p we 
had that the first point labelled i was H*[0]. We need to translate the points labelled i in 
the new construction for the general case. Denote by v{i) the time when the i-th group is 
created and denote by Tj_i the union of the points labelled j, with j < i — 1- We have that 
the first point labelled i is exactly Tj_i[t;(i)]. Set 

U* =^ {E*[n]-Em + '^^-Mi)]■■n>0}, 

u* '^ x*-Em + ^^-lW)]■ (4.4) 

Hence Tj = Tj_i U U* . Moreover, let T = U^^^Tj. It is easy to check that T embeds 
GAM({/j}, {s„}). We prove the theorem on the event {u* is a.s. finite for at least one 
created group j}. Repeating the argument we gave for the case Sn = p, we see that either all 
the groups remains finite or there exists exactly one dominating the others. □ 

Proof of Theorem 11.41 ii). First assume that s„ = p, for some p < 1. Under the 
assumptions of this part of the Theorem, we have that each x*- = oo, a.s.. Hence inf ,■ x*, = oo. 
By our construction, either \\ra.n-^^'r.[n] = oo, in which case the cardinality of each group 
is a.s. diverging to infinity, or lim„_).oo H[n] = F < c«, a.s., in which case #H„ Pi [0, F] < oo, 
a.s.. In words, in the latter case, the cardinality of each group will eventually remain finite, 
for otherwise x* < F < oo for some j, and this would give a contradiction. 

For general s„ < p, we have that n| = oo, where the u* are the random variables defined 
in (|4.4p . Reasoning as in the previous paragraph we get the result for this more general case. 
D 

4.1 An example when the third phase occurs 

Next we show an example where a third phase occurs, i.e. 

lim Ai{n) < oo, a.s. for each z > 1. (4.5) 

n— >oo 

In this example we pick fj{n) = eS^ +") and Sn = p^ (0, 1). Notice that Tj+i — Tj, with i > 1, 
is an i.i.d. sequence of geometrically distributed random variables, with mean 1/p. Hence, 
by a standard exponential bound, we have 



n 

»(t„ > ((1/p) - e)n) = F(^(r, - r,_i) > ((1/p) - e)n) < e" 



-Cn 
=1 

This implies that 

oo 

^P(r„ >n2) <oo. (4.6) 

n=l 
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Next, we use this fact to prove that 

for each j > I there exists an s > j such that x* < x*,. (4-7) 

The latter imphes that infj x^ is not attained. As this infimum is an accumulation point 
for H, this would imply that the smallest accumulation point of S is smaller than x*, for all 
j > I. Hence, (|4.5|) would hold. 

Next we turn to the proof of ()4.7p . Fix j G N. As Ej C H, we have that E[tu] < Ej[tu]. 
Hence, 



J 
=1 



(4.8) 



oo 

< 



1=1 

oo CO 



oo 

< 



The last inequality in ()4.8p is justified as follows. For any pair of events A and i? we have 
that 

P(^) = P(^ I B)F{B) + F{A n S^) < P(^ I B) + FiB"). 



Notice that {r^ < u^} is measurable with respect to the u-algebra 

crlRf^: t < u and i < u^\. 



In words, if we know the first v? — 1 observations of each Hj, with t < u, and the associated 
Bernoullis, we know if the event {r^ < u'^} holds. Hence the latter event is independent of 
the pair 



Hence the last expression in (|4.8p equals 

oo oo 

'Ywt'/fu{Nu{i)) > J2 W^^'/f,{N,{e))) +P(r„ > 



u'] 



e=u^ 



The last expression is summable. To see this, in virtue of (|4.6p . we just need to prove that 
the first term is summable. Then our argument follows from an application of the first Borel- 
Cantelli lemma. In fact, the summability implies that {x* < x^} for infinitely many u. Set 
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7nj = (l/j)e ■^ " ) and recall that j is fixed. For any pair of random variables X and Y 
and any constant a, we have that 

P(X > y) = F{X >Y,X >a) + F{X >Y,X <a)< F{X >Y,X >a) + F{Y < a) 

<F{X >a)+F{Y <a). 

We apply this fact to obtain 

oo oo 

oo oo 

< F[Y,Wt'/fu{N^{i)) > luj)+P{ J2 W^^^/MN,{i)) < 7„,. 
Notice that 



oo oo 

1 1^ ^1 1 1'^ 



Y, l//4n) = Y «"" "" = ^"" E «"" = ^1^"" ' 

while, by a similar reasoning, 'Ylin=u'^ ^/fji^) ~ ^^26"" "-^ • Next we reason like in the proof 
of (IZTTD . with min„>fce"+"^ = e+'=+"'. We have 

oo 

Ywt'/fu{Nu{i)) > ju,j) < exp{-(l -p)2(ei+-')(7„,,- - de""')}, 
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and the right hand-side is summable in u for fixed j. This is because for fixed j, juj > C'le 
for all sufficiently large u. In a similar way, using Chebyshev inequality after applying the 
function e^^ to both sides and choosing 6 = (1 — p)'^e^ ^^ , we obtain 



E W^''/MNm < luj) < exp{-(e- ^^ ){C,e-^ -^ -7.,,}. 

2 "3 

The last expression is summable in u, because, for fixed j, C2e~" ~^ is larger than 7^^ for 
all sufficiently large u. 

5 Brownian motion embedding 

Suppose that the positive function / satisfies the condition ^1^=1 ^/ fiJ) < ^jo. Consider 
an urn with k white balls and 1 red one. We pick a ball at random, and it is white with 
probability f(k)/{f{k) + /(I)). Suppose that by the time of the n-th extraction we picked j 
white balls and n — j red ones. The probability to pick a white ball at the next stage becomes 
f{k + j)/{f{k + j) + f{n + l-j)). Let 

def 

D = {only a finite number of white balls are picked}, 
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Denote by P'''' the probability measures referring to the urn with initial conditions and 
dynamics described above. 

Let F = J27Li ^/fiJ) &iid recall that F^ = Yl'T=k ^/ fU) ■ ^^^ ^^^ process B := {Bt, t > 
0} be a standard Brownian motion, which starts from the point F — F^ = '^^Zi l/f{i). 
Denote by Q^*"' the measure associated with this Brownian motion. We use this process to 
generate the urn sequence described at the beginning of this section, as follows. Set itiq = 
and let 

m/= inf |t >0:Bt hits either 1/fik) or - 1//(1)|. (5.1) 

If B„i-^ — Brrnf > then set zi = 1, otherwise set zi = 0. 

Suppose we defined m„ and zi,Z2, ■ ■ ■ , Zn- Set 0(n) = ^27=1 -^j- ^^ the event (j){n) = s, we 
define 

m„+i = inf <^ t > m„ : Bt - Bm„ hits either — — — - or - — — - 

I f[s + k) /(n-s + 1) 

Set 

( 1 ifB^„^,-B^^ = l/f{k + s) 

def I 

Zn+1 — \ 

( if B^^^, - B„,^ = -1/fin -s + 1) 

By the ruin problem for Brownian motion, we have that 

l/f{s + k) /(„_s + i) 



P {Zn+l = I 4>{n) 



' (l//(5 + fe)) + (l//(n-s + l)) /(c, + fc) + /(n-s + l)' 
which is exactly the urn transition probability. 

In this way we embedded the urn into Brownian motion. In fact, the process 0(ra), with 
n > 1, is distributed like the number of white balls withdrawn from the urn associated to the 
reinforcement scheme described at the beginning of this section. Notice that 

k+(^{ri) n—(j>{n) 

Bm^= Y. (V/(J))- E (V/(s)), withn>0. (5.2) 



Define 



S = lim nin- (5-3) 



This limit exists because the sequence of stopping times {nin} is increasing. For this reason 
S is itself a stopping time. Define 



fe+n oo 



Di = {3n>l:Bs = Y.{l/f{j)) -Y.il/ f{j))} = {Bs<0}, 

oo n 

D, t/ {3n>l:i?5 = E(V/(j))-E(V/(j))} = {i?5>0}. 
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Moreover, in virtue of Theorem 17.11 we have that exactly one of the collection of events 
|{zj = 0},i > 1} and {{zj = l},i > l} holds finitely many times, a.s.. This implies that the 
event Di U D2 holds Q**'-a.s.. By our embedding, we have that 

where D was defined at the beginning of this section. 

Proof of Theorem 11.51 In order to prove our result we only need to prove the following 

fc-i 

~+f(.s)Fk- 



- ^ 1^- 2ii 1+ f(s)^ 



s=l 

Let 

T=^mi[n>l:cl,{n) = '^—^y (5.4) 

This stopping time can be infinite with positive probability. Notice that on {T < 00}, by 
()5.2p . we have that the urn generated by the Brownian motion contains, at time T, an equal 
number of white and red balls, and Bmj, = 0. Viceversa, if we let 

H'^= mi{t>0: Bt = 0}, 

then we have that 

{H <S} = {T < 00}. (5.5) 

To prove ()5.5p . notice that for /c G N, with A; > 0, the random sequence 

k+<f>(n) n—(f){n) 

cannot switch sign without becoming 0. So if Bm > and B^t < 0, for some j < t, then 
there exists an s, with j < s < t, such that Bm^ = 0. In this case, by time s we have a tie. 
We use this fact throughout the proof. 

Recall that under Q'*"' the Brownian motion B starts from F — F^. For j < k, let 

Hj =^ inf{t >0: Bt = Fj- Fk}. 

Notice that Fj — Fk > for j < k. Moreover, by time Hj, with j < k — 1, on the event 
{Hj < S}, at least j red balls have been extracted. To see this, we first focus on Hi, and 
prove that by this time, on the event {Hi < S}, at least one red ball has been picked. 
Suppose that this is not true, i.e. suppose that we picked red balls by time Hi. The reader 
can check from our embedding that this implies that 

min Bt>F-Fk- I//1 = Fi - Fk- 
0<t<S 
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This would imply that Hi > S contradicting our hypothesis. By reiterating the same rea- 
soning we get that the statement holds true for any j < k. 

Define 



M, = < after time -f^j-i, the process B reaches Fj before it hits Fj — Ff^V. 



^j 



On Mj the Brownian motion, after time Hj-i, will hit Fj before there is a tie in the urn, 
because Fj — F^ > 0, for j < k. Next we prove that for any j £ {1,2, ... ,k}, if Mj holds 
then only a finite number of red balls are extracted, i.e. Mj C D2. We split this proof into 
two parts, we first prove that Mj Ci {S < Hj^i} C D2 and then Mj n {5 > Hj^i} C D2. 
In order to prove the first inclusion, recall that under Q'*"' the Brownian motion starts at 
F — Fk. This implies that if S* < Hj^i, then infinitely many balls will be extracted before the 
Brownian motion hits Fj-i — F^. As F — F^ > Fj-i — F^ > 0, we have that infinitely many 
balls will be extracted before B hits 0, i.e. before a tie. This implies that Bs > 0, which in 
turn implies Mj D {S < Hj^i} C D2. 

Next we prove that Mj n {5 > i?j-i} C D2. On the set Mj D {S > Hj^i}, by time Hj_i 
the number of red balls extracted is at least j — 1- This implies that 

00 i-i 

Bs < Y^il/fij)) - E(V/(i)) = F„ yk > n. (5.6) 

j=i t=i 

This is a consequence of (|5.2|) and the fact that n — (j){n) is a non-decreasing random sequence, 
and if n — (j){n) = j — 1 for some n then lim„^oo n — <P{n) > J — 1. Let 

def 

Vi = inf{m„: m„ > Hj^i, and B^^ - B.m,^_^ > 0}, 

i.e. the first time after -f/j-i that a white ball is extracted. The stopping time Vi could be 
infinite. Next we prove that on Mj the random time Vi is a.s. finite. Recall that Hj-i is the 
first time that the process B hits Fj-i — F^, and that < -Fj-i — F^ < F — F^. This implies 
that by time -ffj-i the number of white balls generated by the Brownian motion, plus the 
initial k, overcomes that of the red ones. On Mj, after time -f^j-i, the process will hit Fj 
before it hits 0. This implies that 14 < 00 a.s. on Mj. In fact if no white balls are extracted 
after time Hj-i the process would hit before it hits Fj giving a contradiction. Moreover on 
Mj, we have that Bvi > 0, hence by time Vi the white balls are still ahead with respect the 
red ones. We can repeat the same reasoning with 

def 

V2 = inf{m„: m„ > Vi, and Bm^ - B.,n„_-^ > 0}, 

to argue that V2 is a.s. finite and by time V2 the white balls are still in advantage. By 
reiterating this argument we get that only finite many red balls will be extracted, because 
each Vi occurs before a tie, a.s.. Hence D2 holds when Mj holds. This implies that D2 C M? 
for each j £ {1,2, . . . ,k-l}. If n5^=iM| holds, then either {Bs > 0} holds or {H < S} holds. 
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If the latter event holds, independently of the past, the probability that only finitely many 
white balls are picked is exactly 1/2, by symmetry. Moreover, the events Mj are independent, 
because they are determined by the behavior of disjoint increments of the Brownian motion. 
By the standard ruin problem for this process, we have that 



«'"(M.) - rr^- (") 



We get 



''''(d^) = ^'''(d'^)^III^-ttmf^ 






6 Proofs of Theorem 11.61 and Corollary 11.7 



Proof of Theorem ll.61 Notice that Lead must be a vertex of 71. Under the assumptions 
of the Theorem, the probability that inf j x^ > M is smaller or equal to the probability that 
xl > M. The latter probability is bounded as follows 

oo 

^E^f //(A^i(j)) >M)< exp{-{l-pfai{M-3F)}. 

i=i 

We set Ci = exp{3(l —p)'^F} and C2 = (1 — p)^ai, where ai = inffc>i/(fc). In virtue of 
(|2.25p . the probability that all the vertices at level n are good is at least 

1 - m" inf e-'="('^'^)" + r"'^, 

where c„ (r, M) were introduced at the end of the proof of Lemma 12.21 and m was introduced 
in (j2.2Up . Recall that gn is the set of the vertices of G at level n. Moreover, recall that 
Gn = ^j>n9j- We have 

P(Lead G G„) < P({inf x* > M} U {at least one vertex in gn is not good}) 

r>l 

n 

Proof of Corollary II. 7L Set i(l) = T2 and define recursively i{n) = inf{j > i(n — 
1): i?'-^' = 1}. Notice that i{k) > k. If a vertex v of Q belongs to gi then we have that 
Ti, = i{k) for some k. We have 

P(Lead = !)>!- E[ J^ t{^*<.i}] - P(Lead G G2). 
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We bound the last probability in the previous expression using Theorem 11.61 Order the 
groups at level one, starting from the smaller. As i{k) > k, we have that by the time the k-th 
group at level 1 is created, there are at least k balls in urn 1. Hence, using Theorem II. 5| we 
get 



fc-i 



jegi fc=i e=i ^ -^ w ft 



D 



7 Appendix 

Fix two real numbers r and w, and two sequences of positive real numbers {W{k),k > w} 

and {R{i),i > r}. Suppose we have an urn with w (resp r) white (resp. red) balls. If at step 

n > there are exactly j white balls, with n — j>0>w — j, then the probability to pick a 

white ball is 

W{j) 

W{j) + R{n -j + w)' 

If a white (resp. red) ball is picked, at time n + 1 the composition of the urn becomes j + 1 
(resp. j) white balls and n — j + w (resp. n — j + w + 1) red ones. Denote by 

def 

^R — { the number of red balls in the urn goes to cxd as n — t- oo}, 

def 

A]y = { the number of red balls in the urn goes to cxd as n — t- oo}. 

Let Q be the measure describing the dynamics of this urn. We have the following result, due 
to Herman Rubin (see the Appendix in [7j) 

Theorem 7.1 [H. Rubin] We have the following 3 cases 

''') V SfcLu, (^(^)) = oo and YlT=r i-^i^)) — °^' then both the number or red balls 
and the number of white balls in the urn goes to oo, a.s., as n ^ oo. 



n) ^/Er=«, {W{k)) '<oo and EZr (^(^)) ' = oo, then 

QiAn) = 1. 

^^^) IfEZn^ {W{k)y^ < oo and Er=r (^(^))"' < oo, then 

Q(Ah) + Q{Aw) = Q{Ar u Aw) = 1, 
and both Q{Aji) and Q{A\y) are strictly positive. 
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